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We address a central (and classical) issue in the theory of infinite games: the reduction of the memory 
size that is needed to implement winning strategies in regular infinite games (i.e., controllers that 
ensure correct behavior against actions of the environment, when the specification is a regular O)- 
language). We propose an approach which attacks this problem before the construction of a strategy, 
by first reducing the game graph that is obtained from the specification. For the cases of specifications 
represented by "request-response"-requirements and general "fairness" conditions, we show that an 
exponential gain in the size of memory is possible. 

1 Introduction 

Infinite games are a tool for the construction and verification of reactive systems. We consider the case 
of two players, Player modeling a controller and Player 1 its environment. We deal with finite arenas 
and regular winning conditions, the latter one being captured by standard automata theoretic acceptance 
conditions (for example Biichi or Muller conditions). For these types of games the winner is computable 
and a finite-state winning strategy can be constructed |[ni6][T3ll5]|. 

There are many criteria for measuring the quality of a winning strategy. If only a finite memory is 
needed, then we mostly consider the size of this memory. This view has been pursued in many papers, 
among them [2]. For example, it is known that weak and strong Muller games over a graph with n 
vertices can be solved with winning strategies of size at most 0'{2") and ff{n\), respectively. There are 
well-known examples that show the optimality of these bounds 

A standard method for the construction of winning strategies is to proceed in two steps. In a first 
step, the game graph G is expanded by a memory structure S, yielding a larger game graph G' (loosely 
indicated as G' = 5 x G), while the original winning condition cp is transformed into a simpler one 
{(p'), allowing for positional winning strategies over G'. In the case of weak Muller games the memory 
structure is the powerset of the set of all vertices of G, whereas in strong Muller games we consider 
the set of all sequences of vertices of G. From a positional winning strategy over G' one immediately 
obtains a finite-state winning strategy over G with memory S. A reduction of the memory size can then 
be performed by classical minimization algorithms for sequential functions (as they are computed by 
Mealy automata). 

In this paper we pursue an alternative approach that addresses the aspect of memory reduction at an 
earlier stage, namely before the construction of a positional winning strategy over G' . More precisely, 
we insert an intermediate step of reducing G' , viewing it as an acceptor £^ of an G)-language, where the 
winning condition (p' is used as acceptance condition. This reduction yields a smaller graph G'q with 
memory structure 5o; it has the same type of winning condition as G' . For the graph Gq we construct a 
positional winning strategy, which is subsequently transformed into a finite-state winning strategy with 
memory structure Sq over G. 

The main challenge in this approach is to introduce a method for reducing ft)-automata with accep- 
tance conditions that are known to define games with positional winning strategies. State space reduction 
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Figure 1 : Our Memory Reduction Approach (see Step 2) 



of co-automata is a difficult problem, already for Biichi conditions. In [1] our method was presented for 
the case of weak MuUer games. Reduction of weak parity automata can be done by the method of mini- 
mization presented in [8|. It turned out that the approach can result in an exponential gain, regarding the 
size of the memory needed. 

In the present paper we address the more difficult case of "strong" winning conditions. Whereas 
a weak winning condition merely refers to visits and non-visits to vertices, a strong winning condition 
considers the set of vertices visited infinitely often. We deal with two particular winning conditions of 
practical interest, namely Request-Response and Streett conditions. 

In a Request-Response game the winning condition is a conjunction of statements "whenever a state 
with property p is visited, then sometime later a state with property q". Formally, we are given a set 
D. = . . . , {Pic,Rk)} of pairs of subsets of V. Player wins if every request, i.e., a visit to some 

Pj, is eventually followed by a matching response, i.e., the set Rj is visited sometime later. A Streett 
winning condition is denoted similarly, we call the sets Ej,Fj. Player wins if infinitely many visits to 
Fj imply infinitely many visits to Ej. 

We propose a method to reduce the size of co-automata with the aforementioned types of acceptance 
conditions, using this to reduce the size of game graphs before the construction of winning strategies. 
For the case of Biichi automata we apply the approach of "delayed" simulation presented in [3 J. A state q 
is delayed simulated by a state q' if, in a run from q, each visit to a final state can eventually be answered 
by a visit to a final state in a run from q'. This condition is tested via a simulation game between two 
players. For the parity condition, as obtained by a game simulation of a Streett game, we use an extended 
version of delayed simulation [4]. If in the run from the simulated state a particular color is seen, then in 
the run from the simulating state this color has to be exceeded by a better one (regarding the acceptance 
condition). In our setting, computation of delayed simulation for the Biichi condition can be reduced 
to minimization of standard DFA, whereas for the parity condition we have to solve the corresponding 
simulation game explicitly. 

This work is structured as follows: In the subsequent section we recall the basic terminology and 
known results. Section [3] presents our approach in abstract terms. The main issue here is to reconcile 
the two views of a graph as used for the presentation of an infinite game and for the definition of an co- 
language. Section|4]develops the approach for the case of request-response games and shows an example 
where an exponential gain in the size of the memory needed for implementing a winning strategy is 
obtained. In Section [5] we treat analogously the case of Streett games. 

2 Preliminaries 

An infinite game T = (G, (p) is played by two players. Player and Player 1. The game arena is a finite 
directed graph G = (y,E) with each vertex belonging to either player, i.e., V = Vb W Vi where Vb belongs 
to Player and V\ belongs to Player 1, and E (IV xV . The winning condition (p C V® is the set of all 
infinite paths through G which are winning for Player 0. Starting from an initial vertex the players move 
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a token along edges in E, building up an infinite play p = p (0)p ( 1 )p (2) • • • . If the current vertex belongs 
to Vq, then Player moves the token, and analogously for Player 1. The play p is winning for Player if 
p & (p, otherwise it is winning for Player 1. 

A strategy for Player is a function / defining a next move for every game position of Player 
(analogously for Player 1), i.e., it is a partial function / : V*Vo — )• V such that for every play prefix 
vo---Vk with Vk G Vb it holds {vk,f{vo ■ • -Vjt)) G A play p = p(0)p(l)p(2) • • • is played according to 
f if for all p(/) G Vb it holds p(/+ 1) = /(p(0) ■ • • p(/)). A strategy / is called a winning strategy from v 
for Player if each play starting in v that is played according to / is winning for Player 0. The winning 
region Wq of Player is the set of all vertices from where Player has a winning strategy. Strategies can 
be implemented by I/O-automata in the format of Mealy-automata. 

In this work we deal with Request-Response and Streett games. A request-response winning condi- 
tion is given by a set H = {(Pi . . . , {Pk,Rk)} of pairs of subsets of V. A request is a visit to a set Pj, 
and a response is a visit to a set Rj, for I < j < k. A play p = p(0)p(l)p(2) • • • G V® is winning for 
Player if and only if every request is eventually responded to, i.e., for every j it holds 

V/(p(/) G Pj =^ 3i > i : p(/') G Rj) 

A Streett winning condition is induced by a set H = {{Ei,Fi),. . . , {Ek,Fk)} of pairs of subsets of V. For 
a play p, let Inf(p) be the set of vertices visited infinitely often in p. The play is winning for Player if 
and only if for every pair {Ej,Fj) it holds 

Inf(p) nFy / ^ Inf(p) / 0. 

For the algorithm we are going to introduce we need the notion of game simulation. The key idea of a 
game simulation is to extend the given game graph by a memory component such that on the new game 
graph a simpler winning condition can be used to simulate the original one. Any solution to the extended 
game can be used to compute an I/O-automaton that implements a winning strategy for the original game 
(see for example [ 10|). 

Definition 1. Let F = {G,(p) and F' = {G',(p') be infinite games with game graphs G = {V,E) and 
G' = {V',E') and winning conditions (p, (p'. We say that F is simulated by F' (short: F < F') if and only 
if the following hold: 

1. V' = 5 X y for a finite memory set S (and {s,v) G V/ <;=^ v G V,) 

2. There exists so e S such that every play p of F is transformed into a unique play p' of F' by 

(a) p(0)=v^p'(0) = (^o,v) 

(b) Let{s,v) eV: 

i. {vy)£E^3s' eS: {{s,v),{s',v')) G E' 

ii. {{s,v),{si,vi)) e E',{{s,v),{s2,V2)) G £" =^ 5i = si 

(c) ((^,v),(/,v'))G£'^(v,v')G£ 

3. p is winning for Player in F -4=^ p' is winning for Player in F' 

Later on, we present game simulation algorithms for request -response games by Biichi games and 
for Streett games by parity games [T?,^!. A Biichi winning condition is induced by a set F C V of final 
vertices. A play p is winning for Player if and only if the set F is visited infinitely often: Fnlnf(p) ^ 0. 
We solve a Biichi game as follows: First, we compute the set Recuro(F) Q F consisting of all final 
vertices from where Player can force infinitely many visits to F . Afterwards, we compute the 0- 
Attractor of Recuro(F), i.e., the set of all vertices from where Player can force a visit to Recuro(F). 
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It can be shown that this attractor coincides with the winning region of Player in the Biichi game 
and that an associated attractor winning strategy reduces the distance to F in each move (cf. n^-games 
in [ IQI). A parity winning condition is given by a coloring of the set of vertices, i.e., a function c : 
y — {0, . . . A play p is winning for Player if and only if the maximal color seen infinitely often, 
denoted max(c(Inf(p))), is even. 

All types of games we consider are determined: from each vertex one of the players has a winning 
strategy. In the sequel, we denote a Biichi game {G,F) rather than {G,(p), and analogously for other 
types of games. We assume that the reader is familiar with the basic theory of co-automata (see for 
example HUH). 

3 Reduction of Game Graphs 

The idea of our memory reduction algorithm is to reduce the game graph G' before computing a winning 
strategy. To get this in a formal setting we transform infinite games into w-automata, and vice versa. 
We view the simulating game F' as an ft)-automaton £/ accepting exactly the plays winning for Player 
in r. The automaton ^ is obtained from £/ by state space reduction in such a way that the structural 
properties of game simulation are preserved, i.e., F is simulated by F", where F" is the automaton ^ 
viewed as infinite game. To reduce £/ we compute a language-preserving equivalence relation on the 
memory S. 

Definition 2. Let F = {G,(p) and F' = {G',(p') be infinite games such that F < F'. We define the 
(deterministic) game automaton = {{S x V) U {^0)^sink})'?0)5,i/A, Vb) over V. The function 5 is 
adopted from E' and a transition is labeled by the V-component of its target state. For v' G V we set 
5(^0, v') := (5o,v') and 5(^smk,v') :=^sink- For.? e5,v,v' G V with (v,v') ^£ we set 5{{s,v)y) :=^sink- 
The acceptance condition Y is defined on an abstract level: A run qop' of £/ is defined accepting if and 
only if p' is a winning play for Player in F'. (Conversely, an automaton game is constructed from a 
game automaton in the obvious way. For that we need to keep Vq in jz/.) 

A simulating game and its game automaton are equivalent in the following sense. 
Remark 1. Let F,F' be infinite games such that F < F' and the game automaton of F'. Then, £^ 
accepts exactly the plays winning for Player in F: L{£^) = (p. 

We now reduce the game automaton in such a way that the properties of game simulation are pre- 
served. To retain item[T]from Definition [T] we compute an equivalence relation on 5 x V and refine it 
to on S, where only is used for reduction. Moreover, to achieve item |3] we have to preserve the 
language of £/. We require the following structural properties for 

Definition 3. Let £/ be a. game automaton and let f» be an equivalence relation onS xV. We say that ^ 
is compatible with £^ if and only if the following hold: 

1. For all 51,^2 G 5,v,v' G V: 

(Sl,v) (S2,V) =^ 5{{si,v),v') d{{s2,v),v') 

2. Let p and p' be two runs in (starting at arbitrary states) such that it holds p(/) k, p'{i), for all 
/ G N. Then p is accepting if and only if p' is accepting. 

The quotient automaton of jz/ with respect to k.^ is defined on the basis of the following observation: 
If {s\,v) ^ {s2,v) holds then from these two states exactly the same inputs are accepted. (Note that £/ 
gets as inputs the plays of the game F.) If this is true for all v £V, then si and S2 can be considered 
equivalent. 
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Definition 4. Let £/ he a game automaton and let ?a be a compatible equivalence relation on S xV. The 
equivalence relation on S is defined as follows: 

si '^ssi ■ <^=^ Vv G V : {si,v) ^ (s2,v) 

For s ^ S, [s] denotes the equivalence class of s with respect to ^s- Given si 52 and (v,v') € E, 
let {s'j,v') := 5{{si,v),v') for / = 1,2. According to Definition [3] we know that {s\,v') « ('5'2,v') holds. 
However, s\ ^'2 does not hold necessarily. To get the v'-successor of ([^i], v) well-defined we use some 
fixed total order -<5 on 5. 

Definition 5. Let ^ be compatible and be derived from it as above. We define the quotient automaton 
^/^s = ((-^/^s X ^) W ko,^smk},^o,5/^,,i/^/^,,Vb) over V. Given {[s\,v) G 5/^, x V and (v,v') G £ 
we define 

5/^.((M,v),v'):=([.n.in],v') 

where 

5niin := min{f | 3f : f ss^ 5' and 5((f,v),v') = (f ,v')}. 

The rest of 8/-^ is defined analogously. Let p = ^o(['5'i],vi)([s2],V2) • • • be a run of =2/ /~^,. We define p 
to be accepting if and only if the run p' = qQ{s\,v\){s2,V2) ■ • • oi (which is uniquely determined by p) 
is accepting. 

The run p' is uniquely determined by p because both .sif and /~^, are deterministic. The accep- 
tance condition for £^ /-^ immediately implies L{£if) = L{£^ /~s)- Later on, we show that for reducing 
Biichi game automata and parity game automata there exist compatible equivalence relations which are 
computable efficiently from a game automaton j/. Moreover, the respective quotient automaton jzsf /-^ 
can be defined with the same type of acceptance condition, in both cases. The following theorem shows 
that the automaton game Y" of .s^ /-^ has the same structural properties as T' , i.e., Y is simulated by Y" . 
(For the proof see CJ.) 

Theorem 1 (fVl). Let Y = {G, q)) and Y' = {G' , q)') be infinite games such that Y < Y'. Let £/ be the 
game automaton of Y' and ~ a compatible equivalence relation on S xV. Then Y is simulated by the 
automaton game Y" of si j 

We present the full algorithm for memory reduction. 

Algorithm 1. (MEMORY REDUCTION) 

Input: Infinite game Y = {G, (p) 

Output: Strategy automaton s/f for Player from Wq 

1 . Establish a game simulation of F by a new game F' in which Player has a positional winning 
strategy from Wq (cf. Definition[Tll. 

2. View F' as G)-automaton s/ (cf. Definition |2l). 

3. Reduce G)-automaton s/ : Use a compatible equivalence relation ?a on S x V to compute sa^ on S 
and construct the corresponding quotient game automaton s/ /~s (cf. Definitions I3I5I ). 

4. View automaton game F" = {G", q)") (cf. Definition|2]). 

5. Compute a positional winning strategy for Player in F" and from it construct the strategy au- 
tomaton £/f. 

Algorithm [Udoes not depend on the actual winning condition cp, but we need a suitable relation f» to 
execute step [3] Moreover, Theorem [T]is even valid if F' does not admit positional winning strategies. 
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4 Request-Response Games 

In this section we apply the framework of the preceding section to request-response games. The first 
step is a game simulation by a Biichi game. The idea of this simulation is to memorize the set of open 
requests, and we use a marker (which is cyclically increased) to indicate which request is to be fulfilled 
next. Every time the marker is reset to value 1 we visit a final state. 

Remark 2. Let G = {V,E)hea. game graph and Q. = {{Pi,Ri),. . . ,{Pic,Rk)} a. family of k pairs of subsets 
of V . Then the induced Request-Response game F = {G,Q.) is simulated by a B iichi game V = {G' ,F'). 

Proof. We define the game graph G' = {V',E') and the set F' of final vertices as follows: 

• V := 2{i'-*^} x{l,...,k}x{0,\}xV 

• {{P,i,b,v),{P',i',b',v'))eE':^ 

- p' = (Pu{i\vePi})\{i\veRi} 

_ f / if / G P' 

\ {i mod k) + \ otherwise 

y _ \ 1 if / = ^ and /' = 1 
1^ otherwise 

- (v,vOg£ 

• f .= 2^^ -''^ X {1,...,A:} X {1} X V 

□ 

It is easy to verify (see 113) that the above construction satisfies Definition [T] In the sequel, we 
explain how to compute a compatible equivalence relation for Buchi game automata, using results on 
delayed simulation presented in [3 1. 

4.1 Delayed Simulation for Biichi Automata 

The delayed simulation game ^^g(gO)'?o) ^ Biichi automaton is played by two players. Spoiler 
and Duplicator, and starts at (<70)^o)> where ^Oj^o are arbitrary states of . In the first round Spoiler 
chooses a transition {qo,ao,qi ) G A and Duplicator answers by a transition {q'Q,ao,q\ ) G A with the same 
labeling. From the pair {qi,q[) the game proceeds with the second round, analogously, and so on, until 
infinity. This way. Spoiler and Duplicator build up two infinite paths p = qoqxqi • ■ • and p' = q'Qq\q'2 
respectively. The play {p,p') is winning for Duplicator if and only if 

V%GF^3j>/:^;. GF). 

We say that q^ delayed simulates qo if and only if DupUcator has a winning strategy in Wcie{qo,^o) and 
denote this ^rfe Iq- Moreover, we say that qo,q'o delayed simulate each other, denoted -de q'o, if 
and only if q^ ■<de Qo and q^ qo- Quotienting with respect to preserves the recognized language. 

Lemma 1 (131). Let si he a Biichi automaton. Then it holds L(£/) = L{£/ 

We extend delayed simulation to delayed bisimulation as follows: In each round of the play. Spoiler 
has a free choice whether to take the next transition in either p or p', and Duplicator must take the next 
transition in the other run, afterwards. The winning condition is modified as follows: If a final state is 
seen at position / of either run, then there must exist j > i such that a final state is seen at position j in 
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the other run. If Duplicator wins the corresponding game, then we say that qo^q^ are delayed bisimilar, 
and denote this ^de q'o- 

We make use of the fact that, for deterministic Biichi automata, delayed simulation can be replaced 
by delayed bisimulation, and vice versa. 

Remark 3. Let £/ be a deterministic Biichi automaton. Then, for all states q,q' of £/ it holds 

q-deq <^=^ q^deq- 

Remark [3] follows immediately from the fact that the transitions taken in .s/ are uniquely deter- 
mined by the letters chosen by Spoiler. Hence, it makes no difference whether he chooses the next 
transition in p or p'. To compute we introduce direct bisimulation. It is defined as delayed bisim- 
ulation with the only difference of a modified winning condition in the corresponding game: The play 
{qoqiqi ■ • ■ ,^()<?i<?2 ' • ') winning for Duplicator if and only if 

V%. eF ^ q'.GF). 

If Duplicator has a winning strategy in the direct bisimulation game, starting at (^Oi^o)» then we say that 
qo,q'Q are direct bisimilar, and we denote this 170 ~di q'o- To explain how delayed bisimulation and direct 
bisimulation are connected, we introduce the closure of £/, denoted cl{£/). It has the set F' of final 
states, where we initially set F' := F and iterate the following until a fixed point is reached: 

If there exists q ^ F' such that all successors of q are in F', then put q in F'. 

The automata £/ and c\{£/) are equivalent and, clearly, c\{£/) is deterministic if £/ is deterministic. 
Moreover, note that cl(=e/) can be computed in time linear in The following lemma completes the 
approach to state space reduction of (deterministic) Biichi automata. 

Lemma 2 (lH). Let £/ be a Biichi automaton. For all states q,q' we have 

q ^de q in =2/ <J=^ q ^iji q in cl{£/). 

For a given Biichi game automaton £/ we compute £/ /-^ as follows: We compute the direct bisimu- 
lation relation in c\{£/) which coincides with in £/. Note that for a deterministic Biichi automa- 
ton the computation of r^^, is the same as block partitioning for standard DFA. Hence, the computation 
can be done in time 0'{n\ogn), if n is the number of states of c\{s^) and |r| is assumed constant [91 . As 
a direct consequence, we get that ss^,- is compatible with cl(i</) and ^a^^ (hence, also ~jg) is compatible 
with £^ . From the relation k,^ is computed as given in Definition |4] where « is replaced by 
Finally, we can apply Theorem [T] 

Remark 4. Let be a Biichi game automaton and the delayed simulation relation for . Then 
is compatible with .sif. 

Corollary 1. Let T be a Request-Response game and T' the corresponding Biichi game (cf. Remark^. 
Further, let £/ be the game automaton of T' and defined as above. Then T is simulated by the 
automaton game T" of ^ 

Let us briefly give the memory reduction algorithm for Request-Response games. 

Algorithm 2. (MEMORY REDUCTION FOR Request-Response games) 

Input: Request-Response game T = {G,Q.) 

Output: Strategy automaton =2^ for Player from Wq 

1. Establish a game simulation of F by a Biichi game F' (cf. Remark|2]). 
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2. View r' as Biichi game automaton (of. Definition |2ll. 

3. Compute cl{£/) (of. pagel52l) and tlie direct bisimulation relation in cl(^). By Lemma[2]and 
Remark|3]it coincides with c^^e in -k^- From compute (cf- Definition |4]l. 

4. View £/ /~g as Biichi automaton game T" (cf. Definition |2l). 

5. Compute a positional winning strategy for Player in T" and from it construct si^f. 

We have shown that the delayed simulation relation of a deterministic Biichi automaton can be com- 
puted in time i^{n ■ logw), where n is the number of states and [r| is assumed constant. Here, we get a 
complexity of i?'{n ■ (logn)^), because we have ^(logn) input letters. The Biichi game Y" can be solved 
in time 0'{n^ ■ logn). Hence, the total running time of Algorithm |2]is polynomial in |r'l. 



4.2 An Example for Request-Response Games 

In this subsection we compare the memory size of winning strategies (obtained by Algorithmic with the 
standard approach, where after the conversion of a request- response game into a Biichi game a winning 
strategy is directly computed (and then possibly minimized according to I/O-automata minimization). 
We present an example with an exponential gain in the memory size. 

Consider the game graph Gk, which is shown in Figure |2]for ^ = 3. Let Q. be the following request- 
response winning condition: 

A play proceeds as follows: From the initial vertex v. Player 1 takes k decisions activating either Pj or P'^ 
{i = \ , . . . ,k). At vertex w Player takes over making k decisions himself. In vertex y all pairs from Q. 
are responded to. Hence, each (positional) strategy for Player is winning. 



Pi Pi 

^0 / \ ^ \ / 



^3 



\ / \ / 
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p' 
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Figure 2: Request-Response Game Graph G3 



Theorem 2. Let T/^ = {Gk,^k) be the Request-Response game from Figure^and let T[ = {G'i^,F') be 
the Biichi game simulating T^, constructed as in the proof of Remark^ Then, Player wins T^from v 
such that the following hold: 

1. The positional winning strategy flfor Player Ofrom (0, 1,0, v) in r[ yields a winning strategy f^ 
for Player Ofrom v in of size at least 2^. 

2. The reduced game graph G'[ computed by Algorithm^has only one memory content. 

Proof. If Player precisely mimics the decisions of Player 1, i.e., for all / = 1, ... ,^ she moves to the 
/?(-vertex if and only if Player 1 has moved to the /^-vertex before, then in the Biichi game a final vertex 
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is seen as soon as vertex y is visited for the first time. On the way from vertex w to vertex y, the counter 
in the second component of the memory is increased by one for 2k + I times: It starts with value 1 at 
vertex w and has value 2k + I when reaching vertex x; it is reset to value 1 when the play proceeds from 
vertex x to vertex y. 

If Player makes a mistake, i.e., there exists / such that she moves to the /?;-vertex if and only if 
Player 1 has moved to the /^'-vertex, then a final vertex is reached several moves later than in the case 
where she plays "correctly". This is due to the fact that her false decision at the ith response avoids that 
the counter in the second component of the memory is increased. 

By our remarks above, there is a unique shortest path from vertex w to vertex y which visits a final 
vertex in the Biichi game. It is the path which precisely mimics the path from vertex v to vertex w. 
Solving the Biichi game we obtain an attractor strategy, which means that a final vertex is assumed as 
soon as possible. Hence, the strategy chooses the "correct" path from vertex w to vertex y, and this 
strategy requires a memory of size at least 2*^ because it needs to memorize each of the k decisions of 
Player 1. 

Now, let us consider Algorithm |2] Let V be the Biichi game computed by the game simulation from 
RemarklH and let s := (0, 1, 1,^) £ S xV. Once the play on G reaches vertex y, the set of active pairs 
is emptied, whereby on G' we reach a vertex of the form {0,i,b,y), where I < i < k,b £ M. After at 
most k revisits to vertex y (on G) the value of / is reset to 1, which means that we reach vertex s (on G'). 
Moreover, vertex s is repeatedly visited every 2k + I moves, thereafter. Summing up, every infinite path 
on G' visits vertex s (infinitely often). 

Consider the Biichi game automaton £/ of V. By the remarks above, every infinite path in £/ 
(without ^sink) leads through s, even when starting at non-reachable states. Accordingly, all states in 
5 X V are declared final in cl(i</). Thus, we obtain {s\,v) {s2,v) for all s\,S2 £ S,v £V. Accordingly, 
all memory contents are equivalent, i.e. S/-^ is a singleton. □ 

5 Streett Games 

A Streett winning condition is given by a family of pairs of subsets of V: 



Player wins if and only if, for each j, infinitely many visits to Fj imply infinitely many visits to Ej. 

In a game simulation for Streett games by parity games we keep track of the order of the latest visits 
to the sets Fi,Ei (i = I,. . . ,k). To do so, we use a data structure called Index Appearance Record, short 
lAR |5|. For ^ > 1, we denote the symmetric group of {1, . . . i.e., the set of all its permutations. 

Remark 5. Let G = (V,^) be a game graph and Q. = {(^i . . . , {Ek^Fk)} a family of pairs of subsets 
of V . Then the induced Streett game Y = {G,Q.) is simulated by a parity game V = {G',c'). 

Proof. Let G' = {V',E') be defined as follows. As memory S we use the Index Appearance Record (lAR) 
of V: 



As initial memory content we choose sq := {I ■ ■ - k,!,!). The transition relation E' is uniquely determined 
by E and Q.. We define: 



a = {{Ei,Fi),...,{Ek,Fk)} 



S:=IAR(y) = {(/l•••4,^>, 



f)\{h---ik)£ykA<e,f<k} 



(((/i •••4,e,/),v),((iV--4,e' 

1. {v,v')£E 
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2. (/ J •••/[) is obtained from (/i • • • it) by shifting all // with v G to the left, / G { 1 , . . . , ^} 

3. e' is the maximaQ / G { 1 , . . . , such that v € 

4. /' is the maximal m G {I, . . . ,k} such that v G 

The coloring c' : 1AR{V) x V ^ { 1 , . . . , 2;k} of V is defined by: 

can-ik,ej),v):=^ 2/-1 if.</ 

□ 

We use the right-hand delayed simulation for alternating parity automata (introduced in [4 1) to reduce 
parity game automata. Whereas for Biichi game automata the problem of computing delayed simulation 
can be reduced to the minimization problem for standard DFA (cf. Section ITTI ). for parity game automata 
we have to solve the corresponding simulation game explicitly. It is described in [4J, and we can use a 
simplified version of it. Firstly, a parity game automaton is not alternating which means that the first 
move of each round is made by Spoiler in the simulated automaton and the second move is made by 
Duplicator in the simulating automaton. Due to this fixed order of moving the pebbles we need less 
vertices in the simulation game graph. Secondly, a parity game automaton is deterministic. This means 
that the positions of the two pebbles and the update of the priority memory (see below) are uniquely 
determined by the letter chosen by Spoiler. Hence Duplicator's moves are predetermined by Spoiler's 
moves and, accordingly, all vertices in the simulation game graph belong to Spoiler. Let us define the 
simulation game in a formal way. 

5.1 Right-hand Delayed Simulation for Parity Automata 

We are given a parity game automaton £/ = {{S x V) U {qo,qfimk},Qo,S,c' ,Vo) over V, where a run p 
of £/ is accepting if and only if the maximal color seen infinitely often in p is even. In 141 a min- 
parity condition is assumed. Hence, we have to redefine the coloring of £/ by c' := k — c' for even 
A: G N large enough. We construct the simulation game = {Gf^,(p'/,) as follows: The game graph 
G^Jl = {V^'^,E'/J has the set of vertices V;^' = {S xV) x {S xV) x {c'{S x V) U {/}), where c'{S x V) 
denotes the set of colors assigned by the parity function c'. We set Vsp := V^j^ (and Vpu •= 0)- The edge 
relation E^'^ C VJ^' x VJ^' is defined as follows: 

{{suvi),{s[,v[)) G E', ((52,V2), (4,v^)) e E',v[ = v'^ and 
A'=pm(c'(4,v'i),c'(4,v^),A) 

The priority memory update function pm :NxNx(NU{/})— >NU{/}is defined as follows: 

i. pm(/, j, /) = min{/,7}, if / ^ j 

ii. pm(/,j,/) = /,if 7 ^ / 

iii. pm(/, j,^) = min{/, j,^}, if / ^ j 

iv. pm(/, j, k) = k, if j ■< i, i is odd and / < k, and j is odd ork< j 

V. pm(/, i,k) = / , if i < i, j is even and j < k, and / is even or ^ < / 
vi. pm(/,7,/c) = /, if / is odd, j is even, and both / < k and j <k 
'We assume w.l.o.g. that Ejf = Fj^ = V to have the pointers e' and /' well-defined. 
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vii. else pm{i,j,k) = k 

In the basic definition of the delayed simulation game in (4) the value of pm in case|iv]is set to /. It 
is also shown there that quotienting with respect to the obtained equivalence relation is not language- 
preserving. Hence, we use the slightly modified version from above, where in case|iv]the value of pm is 
set to k. The induced relation is defined on page|56]and preserves the recognized language. (This is also 
shown in [4J.) The binary relation -< is the reward order on N. For m,?i G N, we define m<nif and only 
if 

1. m is even and n is odd, or 

2. m and n are both even and m <n, or 

3. m and n are both odd and n < m. 

This yields 0^2-<4-<...^5-<3-<l. Ifm^« then we say that m is better than n, whereas terms like 
minimum and smaller refer to the standard relation < on N. We leave it up to the reader to verify that 
case [vn] of the definition of pm applies if and only if j ■<i,k < i and k < j. 
A play p is winning for Duplicator if and only if the set 

F :={SxV)x{SxV)x {/} 

is visited infinitely often; this means that is a Biichi condition. (Spoiler wins if and only if he can 
avoid / from a certain point onwards.) Wesay that (527^2) right-hand delayed simulates (i'l ,vi), denoted 
(■^1) vi) (■5'2, V2), if and only if Duplicator has a winning strategy in from the initial game position 
Pi{{^i^vi),{s2,V2)) defined as follows. Let / := c'{si,vi) and j := c'(s2,V2)- 



Pl((5l,Vi),(52,V2)) : = 



{{si,vi),{s2,V2),mm{i,j}) if i ^ j 
((■^i,vi),(52,V2),/) otherwise 



In HI it is shown that <^'^ is a preorder implying language containment, i.e., if (^ijVi) <^'^ {s2,V2) then 
L{-s^(si.vi}) ^ ^('2^'(s2,v2))- define the corresponding equivalence relation ^""Jl as 

i^hn) ^dl i^2,V2) ■ {si,Vi) -C'l {S2,V2) and (S2,V2) <''de (■5l,Vi). 

Duplicator's winning region Wdu in determines <''j],, and from that we can compute Note that 
we need to consider only the case where it holds vi = V2 (cf. Definition |4l). 



5.2 Quotienting 

The relation is compatible with a parity game automaton. Item [T] of Definition [3] is verified by the 
upcoming lemma. Item [2] of Definition [3] follows from the fact that quotienting with respect to Rij^ is 
language-preserving, as is shown in [4 |. 

Lemma 3. Let si he a parity game automaton and defined as in Section \5A\ Then, for all s\^S2 G 
5, vi , V2 , v' G V holds: 

(.l,Vi) (^2,V2) =^ 5((.i,Vi),v') 5((^2,V2),V') 

Remark 6. Let £/ he a. parity game automaton and the right-hand delayed simulation relation 
for s/. Then is compatible with s/. The quotient automaton £/ j is defined in the natural 

way: 5/^., ([(.,v)],v') := v), v')] and c'/,.. ([(., v)]) := min{c'(.',v') | (>,vO is 

de J , de de 

equivalent to m . We refer to ||41 for the details. 
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We compute from ^^^J^ (as in Definition |4l) and express the acceptance condition y/^s °f /~s 
in terms of a coloring c' /~^. To this end, let s € 5,v € V and define 

c7.,([.],v) :=min{c'(/,v') | (/,v') (^,v)}, 

and let i ^sink inherit their color from Since si^s^i implies (^i , v) ss^^ (^2 , v) for all v G V, the above 
definition of c'/~^. is independent of representatives. Note that is/ /-^ is a game automaton. Essentially, 
the relation is a refinement of Hence, the automaton /-^ is equivalent to £/. 

Lemma 4. Let be a parity game automaton and / the corresponding ^.^-quotient with coloring 
c'/«j (see above). Then si and si j ~^ are equivalent. 

Proof. We have to show L{si) = L{si / ~ ), where it suffices to show L{.si / ^,h) = L{si / ~ ). By 
Lemma [3] automaton £^ j r^rh is deterministic, and by Definition [5] automaton s^/^c is deterministic. 
For a G V®, let p be the run of si /- on a and p' be the corresponding run of si' /^h, on a. The run p' 
is uniquely determined by the run p, because both si /^h, and si /-., are deterministic and is a refine- 

ment of Ri^'^. Moreover, p is accepting if and only if p' is accepting, because both runs have the same 
sequence of colors. Since both si /- and si /^,;, are deterministic, there is no other run on a, neither 
for si / ~ nor for si / . Thus, a is accepted by si / ~ if and only if it is accepted by s^ / ^,-1, . □ 

^de ^de 

Our above results show that our algorithm for memory reduction is applicable to a Streett game T as 
follows: We simulate F by a parity game F' which is then transformed into a parity game automaton si . 
For si we construct the right-hand delayed simulation game ^J^' and solve it by standard techniques 10. 
Duplicator's winning region in this game and Definition |4] uniquely determine ^5. The corresponding 
quotient automaton si /-^ is a parity game automaton equivalent to si, and we can transform it into a 
unique parity automaton game F". By Theorem [T] F is simulated by F". 

Corollary 2. Let T be a Streett game and F' the corresponding parity game (cf. Remark\5]). Further, let 
si be the game automaton ofV and defined as above. Then F is simulated by the automaton game 
T"ofsi/„, 

This yields the following algorithm. 

Algorithm 3. (MEMORY REDUCTION FOR Streett games) 

Input: Streett game F = (G,n) 

Output: Strategy automaton sif for Player from Wq 

1. Establish a game simulation of F by a parity game F' (cf. Remark|5]). 

2. View F' as parity game automaton (cf. Definition |2l); redefine the coloring of si asc' :=2k — c'. 

3. Construct the delayed simulation game for si and solve it. From Duplicator's winning region 
compute (cf- Definition |4l). 

4. View si /-^ as parity automaton game F" (cf. Definition |2ll. 

5. Compute a positional winning strategy for Player in F" and from it construct sif. 

At this point we have to mention an optional normalization, which may make the relation larger. 
It is done before executing step [3] For each SCC C of si we iterate the following: While there exists 
{s,v) G C such that c'{s,v) > 2 and there exists no {s',v') G C such that c'{s' ,v') = c'{s,v) — 1 do c'{s,v) := 
c'{s,v) — 2. Clearly, this does not change the accepted language 141 . 

^Note that F" is a min-parity game. 
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For the computation of r^^'^ we need to solve the simulation game ^J^. It is a Biichi game of size 
0'{r^ ■ k) where r is the number of states of and k is the number of colors assigned by c' . Since Biichi 
games are solvable in polynomial time measured in the size of the game graph, the overall running time 
of Algorithm |3]is polynomial in |r'[. 

Note that the above technique can analogously be applied to Muller games (see 0). The only 
difference is the game simulation in step [T] Muller games can also be simulated by parity games, but 
the needed memory depends on the number of vertices of the game graph G. Accordingly, we need to 
redefine c' := 2\V\ — c' at the end of stepE] The rest of the algorithm is the same as for Streett games, 
and we obtain a similar running time. 



5.3 An Example for Streett Games 

Let us show a result for the class of strong winning conditions, similar to the one above. We consider 
the Streett game in Example [T] (see below) and make particular assumptions on the winning strategy for 
Player in the simulating parity game. More precisely, we demand that she behaves "optimal". This is 
meant in the sense that she continuously chooses those edges which globally guarantee the best colors 
she can enforce. (A color m is better than a color nifn ^m, where -< is the reward order from page l56l ) 
Example 1. Let Gk be the graph shown in Figure |3] (for k = 3), and Q-j^ the following Streett winning 
condition: 

Il, = {(£i,Fi),(£_i,F_i),...,(£,,F,),(£_,,F_,),(y,y)} 



£-2,^2 £-3,^3 EuF^i £2,^-2 £3,^-3 yi-Ei,E^ 



\ / \ / 



Vl 



V2 



V3 



\ / \ / \ 



(Wj) (W2) 



3) X 



Ei,F^i £'2,F_2 -£'31^-3 E^i,Fi £'_2,^2 E^t,,F^ 



Figure 3: Streett Game Graph G3 



The game proceeds similarly to the one from Figure |2] The major difference is that vertex vi is 
visited infinitely often, naturally dividing each play into rounds. At the end of each round, i.e., when the 
play proceeds from vertex y to vertex vi , the highest possible color 4^ + 2 is seen in the parity game r[ 
(simulating Fjt). This is due to the fact that some index must be at the last position of the current lAR 
and, accordingly, the pointer e' has the value 2k + \ (cf. page [55] ). Thus, each play satisfies the parity 
winning condition, and each (positional) strategy for Player is winning. 

Theorem 3. Let = {Gk,^k) be the Streett game from Example\J\and let r[ = {G'i^,Ck) be the parity 
game simulating (according to Remark\5^, where sq := ((1 ■ • • 2^ + 1), 1, 1) is the initial memory 
content. Then, Player wins T^from vertex v\ such that the following hold: 
1. Each positional winning strategy f'^for Player in T'j^from {so,vi ) with 

{ckis',v') I {{s,v),{s' ,v')) £ Ef^}n{2n+ \ \n£N} = 

^Efi_ denotes the set of all edges determined by the positional strategy 
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yields a winning strategy fkfar Player in r^from v\ of size at least 2^. 
2. The reduced game graph G'l. computed by Algorithm\3}has only one memory content. 

Proof. For simplicity, we assume k = 3; the proof is analogous for other values of k. First, we fix a 
convention on the entries in an lAR: Let the pair {V,V) be represented by V and every other pair by 
its unique index, i.e., (£'_2,^-2) is represented by —2; if index / has value —j, then let — / := j, for 
1 < 7 < 3. 

The intersection in item[T]means that Player chooses only edges leading into vertices of even color. 
To prove that item, note that the permutation reached (in when vertex w\ is reached (in Ft,) is of the 
form 

{Vi3---iip), 

where ij € {j, —j} (for 1 < 7 < 3), and p is some permutation of the set {—ii , —12, —13}. For example, let 
Player 1 at vi decide to move up, then again up, and then down; then the permutation is (V 3 —2 — 1 p), 
because the indices —1,-2 and 3 are shifted to the second position one after another, and V stays at the 
front (cf. proof of Remark|5]l. Moreover, /? is a permutation of {1,2, —3}, i.e., of the set of all indices / 
for which Player 1 has moved to F,, recently. 

If Player moves upwards at w\, i.e., she mimics Player I's behavior at vertex vi, then the permuta- 
tion shortly becomes (V 1 3 — 2 — 1 p'), and p' is either (2 —3) or (—3 2). That means e' is assigned the 
value 5, 6 or 70 Simultaneously, /' gets the value 5 because F_i is visited and —1 is at the fifth position 
in the new permutation. Accordingly, it holds e' > f, which means that we see color 10, 12 or 14. 

Conversely, if Player moves downwards at wi , then the permutation becomes {V — I 3 —2 p"), with 
p" = p. Hence, e' is assigned 4 because index — 1 comes from the fourth position. Moreover, index 1 is 
located somewhere in p", i.e., at position 5, 6 or 7. Thus, we have /' > 5 > 4 = e'; accordingly, we see 
an odd color, either 9, 11 or 13. 

Making an analogous observation at vertices W2,W3, we can deduce the following: If Player mimics 
Player I's behavior then she visits an even color, say /; if she makes the "wrong" move then a vertex 
of odd color less than / is seen. Let the latter situation be called an error and note that Player can 
play errorless by memorizing Player I's decisions. By an argument analogous to that in the proof of 
Theorem |2] implementation of an errorless winning strategy requires a memory of size at least 2*^. 

To see item|2l let us consider Algorithm |3] First, note that the coloring ct is redefined as ct '■= 
4k + 2 — Ck (in step ID; accordingly, we are dealing with a min-parity condition from now on. Every 
play on G[ must traverse an edge ((^'1,3'), (^2,^)) infinitely often, for some lARs si,S2. Thereby, a vertex 
with the smallest possible color is visited, because there must be 1 < j < ^ such that index j or —j 
is at the last position of the lAR si. Thus, in the simulation game the priority memory is reset to / 
and a final vertex is visited, infinitely often. Accordingly, Duplicator has a winning strategy from each 
vertex in the simulation game graph. Summing up, all states (having the same V-component) in the 
parity game automaton £/ of r[ are ^^'^-equivalent, which means that all memory contents are declared 
Ri^-equivalent. Thus, we obtain a reduced memory of size one. □ 

6 Conclusion 

We have presented a method that reduces the memory for implementing winning strategies in Request- 
Response and Streett games. The key idea is to view the result of a game simulation as an ou-automaton 

^Note that the definition of e' refers to the old permutation, and index 1 came from position 5, 6 or 7 (cf. pagelSSt. 
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whose state space contains the memory to solve the given game. This state space is reduced via the 
notion of delayed simulation (cf. Em). The reduction is carried out only on the set of memory contents, 
where two memory contents are considered equivalent if, from them. Player wins exactly the same 
plays. In our setting, delayed simulation can be computed in time ^(n • (log n)^) and ff{n^ ■ k) for Bilchi 
and parity game automata, respectively, where n is the number of states of the game automaton and k 
the number of colors (in the parity game automaton). In both cases our algorithm has a running time 
polynomial in the size of the simulating game T' . 
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